Solution for adding context to a text exchange modality during interactions with a composite services application

ABSTRACT

The present invention discloses a solution for handling text exchange input in a composite service environment. The method can included a step of conveying text exchange input to an ACM engine. The ACM engine can map the text exchange input to field and variable values of a component services application. A memory of the composite service environment can be updated using the mapping results. Specifically, field and variable values can be updated that are stored in the memory for a session of the component services application for which the text exchange input was provided.

CROSS-REFERENCE TO RELATED APPLICATIONS

This continuation-in-part application claims the benefit of U.S. patent application Ser. No. 11/297,536 filed Dec. 8, 2005; U.S. patent application Ser. No. 11/296,952 filed Dec. 8, 2005; and, U.S. patent application Ser. No. 11/297,542 filed Dec. 8, 2005, which are hereby incorporated by reference herein.

BACKGROUND

Field of the Invention

The present invention relates to the field of composite services, and, more particularly, to a solution for adding context to a text exchange modality during interactions with an automated application executing in a composite services environment.

Description of the Related Art

A composite services environment permits users to interact with automated applications using different interactive channels, interface types, and modalities. An interactive session of the application can concurrently share information with multiple clients, which can interact using different modalities.

For example, two different users (or a single user) can concurrently participate in a single application session; one interaction occurring via a mobile phone over a voice channel and the other occurring via a browser over a Web (e.g., data) channel. User input provided over the voice channel can cause a dynamic update to browser presented information. For instance the phone user can provide their name and account number to a voice response system. This information can be recorded by the composite services application and placed in a shared memory used by many modalities. When the shared memory is updated, interfaces of every concurrent interface sharing this memory can be dynamically updated. Thus, a Web form can presented in the Web browser can be dynamically updated so that fields associated with a user name and account are filled with content corresponding to the voice input.

Problems interacting with a composite service application can occur when one interactive modality is a free form input modality (i.e., a chat modality) and a different one is tightly constrains or directs input/output. A composite service application needs to translate input entered into the chat interface to data fields of a Web modality or to voice dialog. The problem is generally one of how to derive context from a free form input, such as chat input, and applying that context modified input to concrete data fields, which are shared by other less free-form modalities.

SUMMARY OF THE INVENTION

The present invention utilizes Natural Language Understanding (NLU) runtime components to apply context to free-form input, such as text entered through a text exchange interface. NLU runtime components exist that use application specific statistical models. These models can be referred to as Action Classifier Modules (ACMs). These ACMs map natural language requests from a user to one of many possible routing targets. As traditionally used, ACMs cause an interactive session to branch to a directed dialog form and/or cause a caller to be routed to a human agent that specializes in handling issues of a type that a user is experiencing. For example, a traditional usage of an ACM can automatically direct a call to an electronics department in response to a free form input from a user that is related to electronics.

The present invention uses ACMs in a novel way to apply context to a text exchange interaction with a composite services application. Instead of using NLU statistical models (built from a list of sentences) to route a call, the NLU statistical models can be used to discover what fields in the data store are associated with text-exchange input. Values for these fields are then updated based upon text-exchange content. Listeners associated with other clients, such as Web browser client and/or a voice client, can detect the value updates and can trigger client specific update events. For example, GUI elements of a Web browser linked to the fields can be dynamically updated based upon the text exchange input.

The present invention can be implemented in accordance with numerous aspects consistent with material presented herein. For example, one aspect of the present invention can include a method for handling text exchange input in a composite service environment. The method can included a step of conveying text exchange input to an ACM engine. The ACM engine can map the text exchange input to field and variable values associated with concrete fields of a composite services application. A memory of the composite service environment can be updated using the mapping results. Specifically, variable values can be updated that are stored in the memory for a session of the composite services application for which the text exchange input was provided.

Another aspect of the present invention can include a method for handling text exchange input in a composite services environment. In the method, user provided text exchange input can be received. The text exchange input can be conveyed to a Chatbot software object. The text exchange input can then be processed through a NLU runtime component to identify application specific fields to which the text exchange input is associated. The Chatbot software object can send a context applied message to a model server of the composite services environment. The context applied message can include the identified fields and related value data that was contained in the text exchange input. The model server can update shared data upon receipt of the context applied message.

Still another aspect of the present invention can include a composite services system that includes a data storage area and an ACM engine. The data storage area can include data fields and values for an interactive session of a composite services application. Clients of the composite services application participating in the interactive session can utilize the data storage area to update the values based upon input entered into the clients. The clients can also obtain information from the data fields and values for presentation to a user. The ACM can map in real time free form text input to data fields and values for the interaction session of the composite services application, which applies application specific context to user provided free form input.

It should be noted that various aspects of the invention can be implemented as a program for controlling computing equipment to implement the functions described herein, or a program for enabling computing equipment to perform processes corresponding to the steps disclosed herein. This program may be provided by storing the program in a magnetic disk, an optical disk, a semiconductor memory, or any other recording medium. The program can also be provided as a digitally encoded signal conveyed via a carrier wave. The described program can be a single program or can be implemented as multiple subprograms, each of which interact within a single computing device or interact in a distributed fashion across a network space.

It should also be noted that the methods detailed herein can also be methods performed at least in part by a service agent and/or a machine manipulated by a service agent in response to a service request.

BRIEF DESCRIPTION OF THE DRAWINGS

There are shown in the drawings, embodiments which are presently preferred, it being understood, however, that the invention is not limited to the precise arrangements and instrumentalities shown.

FIG. 1 is a schematic diagram of a system for interacting with a composite services environment in a manner in which application specific context is applied to text exchange input in accordance with an embodiment of the inventive arrangements disclosed herein.

FIG. 2 is a schematic illustration of a composite services environment that uses a Chatbot object to add context to text exchange interactions in accordance with an embodiment of the inventive arrangements disclosed herein.

FIG. 3 is a flow chart of a method for using a Chatbot to convert text exchange input values associated with concrete fields of a converged services application in accordance with an embodiment of the inventive arrangements disclosed herein.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1 is a schematic diagram of a system 100 for interacting with a composite services environment 150 in a manner in which application specific context is applied to text exchange input in accordance with an embodiment of the inventive arrangements disclosed herein. In system 100, a Chatbot object 130 is used to apply application context to free form input. That is, the Chatbot object 130 determines fields of a composite services application 154 to which free form input relates. More specifically, Natural Language Understanding (NLU) runtime component 140 can use statistical language models to determine applicable fields from free form input. After this determination is made, the Chatbot object 130 can send an information update message to the application 154, which is used to update information in table 158. Table 158 updates can be conveyed to other clients 112-114 accessing composite services application 154.

To illustrate, a user can enter text input into text exchange interface 120, such as “97 Honda's too hot ;-(. Something's wrong. Can you fix it?” The Chatbot Object 130 can run this input through the NLU runtime component 140, which can specifically utilize Action Classifier Module (ACM) engine 142. The NLU component 140 can first determine that the input relates to “automotive repair,” that has a specific application model (stored in data store 144). This model can be associated with table 158, which includes multiple model fields and values, such as customer, make, and model. The NLU component 140 can match input to these fields. For example, Honda can be mapped to a make field. Other fields of table 158 of data store 156 can include, CarYear that can map to 1997, CarPart, CarPartStatus, CarVIN, CarOwner, and the like. After the matches are performed, suitable values of table 158 can be updated. Other interfaces 122-124 can make use of this updated table 158 information. For example, graphical user interface (GUI) element of interface 124 that is bound to a field of table 158 can be updated.

In system 100, different types of devices, such as a text exchange device 110, a communication device 112, and a Web device 114, can each access composite services application 154 executed by application server 152 contained within environment 150 via appropriate interfaces (e.g., text exchange interface 120, voice interface 122, and Web interface 124).

The text exchange interface 120 can include any interface capable of exchanging text in real time, such as a chat interface, an instant messaging interface, and a text messaging interface. The voice interface 112 can be any interface configured for real time voice communications, such as a mobile telephone interface, communication interface of a desktop computer, a Voice over Internet Protocol (VoIP) interface, and the like. The Web interface 120 can be any interface capable of rendering markup or Web based content. For example, the Web interface 120 can be a Web browser.

The NLU runtime component 140 can use statistical models that map natural language requests (i.e., free form input) to one of many possible routing targets. In one embodiment, the statistical models can be specifically designed to handle text interface input, such as emoticons and chat slang. The routing targets of system 100 are table 158 fields. The Chatbot object 130 can be a software object containing programmatic logic designed to relate free form data to concrete model items (e.g., fields and field values of table 158).

Various communicatively linked components 110, 112, 114, 130, 140, 144, 152, and 156 that are illustrated in system 100 can be connected via one or more networks (not shown). The networks can include any hardware/software/and firmware necessary to convey digital content encoded within carrier waves. Content can be contained within analog or digital signals and conveyed through data or voice channels. The networks can include local components and data pathways necessary for communications to be exchanged among computing device components and between integrated device components and peripheral devices. The networks can also include network equipment, such as routers, data lines, hubs, and intermediary servers which together form a packet-based network, such as the Internet or an intranet. The networks can further include circuit-based communication components and mobile communication components, such as telephony switches, modems, cellular communication towers, and the like. The networks can include line based and/or wireless communication pathways.

Each of the data stores 144 and 158 can be a physical or virtual storage spaces configured to store digital content. Data stores 144 and/or data store 158 can be physically implemented within any type of hardware including, but not limited to, a magnetic disk, an optical disk, a semiconductor memory, a digitally encoded plastic memory, a holographic memory, or any other recording medium. Further, each data store 144 and 158 can be a stand-alone storage unit as well as a storage unit formed from a plurality of physical devices. Additionally, content can be stored within data stores 144 and 158 in a variety of manners. For example, content can be stored within a relational database structure or can be stored within one or more files of a file storage system, where each file may or may not be indexed for information searching purposes. Further, the data stores 144 and 158 can utilize one or more encryption mechanisms to protect stored content from unauthorized access.

It should be appreciated that the specific arrangements of system 100 are for illustrative purposes only and that the invention is not to be construed as limited to exact arrangements expressed herein. For example, system 100 shows Chatbot object 130 as external to the composite services environment 150 and shows the NLU runtime component 140 as internal to the environment 150. In one contemplated embodiment (which is graphically illustrated in FIG. 2) the Chatbot object 130 can be a component of environment 150. In another embodiment, the NLU component 140 can be implemented outside environment 150, yet be implemented in a manner so that it is able to share application context information concerning application 154, such as through an API interface.

FIG. 2 is a schematic illustration of a composite services environment 200 that uses a Chatbot object 270 to add context to text exchange interactions in accordance with an embodiment of the inventive arrangements disclosed herein. Environment 200 is one contemplated embodiment for environment 150. In the composite services environment 200, different channels of access to a service can be established for accessing a service through corresponding different modalities of access including voice and visual (e.g., Web and/or Text Exchange) modes. A service can be concurrently accessed by different clients using different communication channels. Additionally, free-form input (i.e., text exchange input) can be routed through a Chatbot object 270, which uses a NLU component 272 to map the input to application specific fields for a suitable application context.

Specifically, interactions with a service within a communication session can be provided across selected ones of the different communication channels, each channel corresponding to a different modality of access to the service. In the case of a voice modality and a visual modality, a separate markup document can be utilized in each selected channel according to the particular modality for that channel.

Importantly, each channel utilized for accessing a service within a session can be associated with each other channel accessing the service within the same session. In consequence, the state of the service—stored within a model in a model-view-controller architecture—can be maintained irrespective of the channel used to change the state of the service. Moreover, the representation of the service can be synchronized in each view for the selected ones of the different channels. As such, an end user can interact with the service in a single session across different channels of access using different modalities of access without requiring burdensome, proprietary logic deployed within a client computing device.

As illustrated, composite services can operate in an application server 275 and can include multiple channel servlets 235 configured to process communicative interactions with corresponding sessions 225 for a composite multimedia service over different channels of access 245, 250, 255 for different endpoint types 260A, 260B, 260C in a communication network. In this regard, the channel servlets 235 can process voice interactions as a voice enabler and voice server to visual endpoint 260A incorporating a voice interface utilizing the Real Time Protocol (RTP) over HTTP, or a voice endpoint 260B utilizing Session Initiation Protocol (SIP). Likewise, the channel servlets 235 can process visual interactions as a Web application to a visual endpoint. As yet another example, the channel servlets 235 can process instant message interactions as an instant messaging server to an instant messaging endpoint 260C.

More specifically, the channel servlets 235 can be enabled to process HTTP requests for interactions with a corresponding session 225 for a composite multimedia service. The HTTP requests can originate from a visual mode oriented Web page over a visual channel 245, from a visual mode oriented text exchange interface over a text exchange channel 255, or even in a voice mode over a voice channel 250 enabled by SIP. When interactions occur over channel 255, Chatbot object 270 can provide context using NLU component 272, as detailed in system 100. The channel servlets 235 can be enabled to process SIP requests for interactions with a corresponding session 225 for a composite multimedia service through a voice enabler which can include suitable voice markup, such as VoiceXML and call control extensible markup language (CCXML) coupled to a SIPlet which, in combination, can be effective in processing voice interactions for the corresponding session 225 for the composite multimedia service, as it is known in the art.

Each of the channel servlets 235 can be coupled to a model servlet 220. The model servlet 220 can mediate interactions with a model 210 for an associated one of the sessions 225. Each of the sessions 225 can be managed within a session manager 221 which can correlate different channels of communication established through the channel servlets 235 with a single corresponding one of the sessions 225. The correlation of the different channels of communication can be facilitated through the use of a coupled location registry 230. The location registry 230 can include a table indicating a host name of systems and channels active for the corresponding one of the sessions 225.

The model servlet 220 can include program code enabled to access a model 210 for a corresponding session 225 for a composite multimedia service providing different channels of access 245, 250, 255 through different endpoints 260A, 260B, 260C. For instance, the model 210 can be encapsulated within an entity bean within a bean container. Moreover, the model 210 can store session data for a corresponding one of the sessions 225 irrespective of the channel of access 245, 250, 255 through which the session data for the corresponding one of the sessions 225 is created, removed or modified.

Notably, changes in state for each of the sessions 225 for a composite multimedia service can be synchronized across the different views 260 for the different channels of access 245, 250, 255 through a listener architecture. The listener architecture can include one or more listeners 240 for each model 210. Each listener can correspond to a different channel of access 245, 250, 255 and can detect changes in state for the model 210. Responsive to detecting changes in state for the model 210 for a corresponding one of the sessions 225 for a composite multimedia service, a listener 240 can provide a notification to subscribing view 260 through a corresponding one of the channel servlets 235 so as to permit the subscribing views 260 to refresh to incorporate the detected changes in state for the model 210.

FIG. 3 is a flow chart of a method 300 for using a Chatbot to convert text exchange input values associated with concrete fields of a composite services application in accordance with an embodiment of the inventive arrangements disclosed herein. The method 300 can be performed in the context of a system 100 or any system that provides converged services.

The method can begin in step 305, where a user can type text into a text exchange client. In step 310, a Chatbot object can receive the text and send it to a NLU runtime component 320 for processing. Specifically, the text input can be matched against entries in a statistical language model 322, which can include application specific grammars as well as text exchange language specific grammars. The Chatbot object can receive results from the NLU runtime component 320, which maps the text exchange input to application specific fields/variables and values. In step 330, the Chatbot can send a message to a composite model server to update the field values based upon the mapped input. In step 335, a Web browser modality can see the field updates, which are automatically presented (i.e., after a browser refresh event) after the update occurs.

To illustrate method 300 with a concrete example, a user can enter “87 Honda ac broke” into a chat interface (step 305). The Chatbot object can receive the text (step 310) and can run the text through a car service NLU runtime component (320), which uses an application specific statistical model (322). The output of the process can be a set of fields and values for a car service application (step 325) (e.g., CarYear=87; CarMake=Honda; CarPart=ac; CarPartStatus=broke). The Chatbot server can send (step 330) a message to the model server to update the output fields (i.e., update fields/values for CarYear, CarMake, CarPart, and CarPartStatus). Other modalities listening for model server updates can receive new values for the fields and can update their views. For example, GUI fields of a Web browser for CarYear, CarMake, CarPart, and CarPartStatus can be automatically updated (step 335), which provides a user with updated information.

The present invention may be realized in hardware, software, or a combination of hardware and software. The present invention may be realized in a centralized fashion in one computer system or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system or other apparatus adapted for carrying out the methods described herein is suited. A typical combination of hardware and software may be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.

The present invention also may be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which when loaded in a computer system is able to carry out these methods. Computer program in the present context means any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: a) conversion to another language, code or notation; b) reproduction in a different material form.

This invention may be embodied in other forms without departing from the spirit or essential attributes thereof. Accordingly, reference should be made to the following claims, rather than to the foregoing specification, as indicating the scope of the invention. 

What is claimed is:
 1. A method comprising: receiving a Natural Language Understanding (NLU) statistical model that maps natural language requests to routing targets, a data store organized to define a plurality of fields, and a first record including a plurality of initial field values corresponding to the plurality of fields; displaying, in a web browser, graphical user interface (GUI) elements corresponding to the plurality of fields; receiving text data corresponding to a portion of free-form natural language text, with the text data relating to the first record; determining a plurality of updated field values corresponding to at least some of the plurality of initial field values in the first record, using the received text data and the received NLU statistical model; replacing at least some of the plurality of initial field values in the first record with the determined plurality of updated field values; detecting the updated field values to trigger an update event; and updating at least some of the displayed GUI elements in the web browser based on the determined plurality of updated field values being detected.
 2. The method of claim 1 wherein: the NLU statistical model is a probabilistic model based on patterns present in training data in the form of sentences; and the NLU statistical model is designed and programmed to determine what a user wants to do based on natural language text supplied by the user.
 3. The method of claim 1, wherein: the data store is modality independent; and the data store is accessed in at least the following ways: visually through a web browser and by voice over a telephone connection.
 4. The method of claim 1, wherein the data store includes physical and virtual storage spaces configured to store digital content.
 5. The method of claim 4, wherein the text data includes emoticons and chat slang.
 6. The method of claim 5, wherein the NLU statistical model includes models that map a natural language request to possible routing targets.
 7. The method of claim 6, wherein the first record is an automotive record.
 8. The method of claim 7, wherein the plurality of initial field values includes CarYear, CarOwner, Car Part, Car PartStatus, CarVIN.
 9. The method of claim 1, further comprising running the text data through a car service NLU runtime component.
 10. The method of claim 9, further comprising outputting the plurality of updated field values for a car service application.
 11. The method of claim 1, wherein the determining the plurality of updated field values further comprises using the received NLU statistical model to map the received text data to at least some of the plurality of fields in the data store.
 12. The method of claim 1, wherein the free-form natural language text is received via a first modality.
 13. The method of claim 12, wherein the first record is accessible via the first modality and a second modality.
 14. The method of claim 13, wherein the first modality is a free-form input modality.
 15. The method of claim 14, wherein the second modality constrains input/output to the plurality of fields.
 16. The method of claim 15, wherein the determining the plurality of updated field values further comprises using the received NLU statistical model to map the text data received via the first modality to at least some of the plurality of fields in the data store.
 17. The method of claim 16, further comprising sending a message to update the field values based upon a mapped input of the text data.
 18. The method of claim 17, wherein the detecting the updated field values corresponds to different channels of access through a listener architecture.
 19. The method of claim 18, wherein the listener architecture includes one or more listeners corresponds to the different channels of access for endpoint devices.
 20. The method of claim 19, wherein the endpoint devices are a mobile computing devices. 